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1. Introduction 


We prepared this report to fulfill a request from the Florida Agency for Health Care Administration. This 
report contains three documents: 1. Main document (this document) summarizing the methodology 
used and the findings, 2. Methods document, which provides a detailed description of the systematic 
methodology used to find, prioritize, appraise, and synthesize the evidence, and 3. Results document, 
which describes the evidence available, the estimates of the effects of gender affirming therapies, and 
the certainty (also known as quality) of the evidence. 


This document is organized in four parts. First, we describe the credentials and expertise of the health 
research methodologists conducting this evidence evaluation. Second, we summarize the methodology 
used. Third, we summarize the main findings. Finally, we briefly discuss strengths and limitations of our 
process and of the evidence. 


2. Credentials and expertise 


Two experts in health research methodology, who specialize in evidence synthesis to support decision 
making, prepared this report. Their relevant credentials and expertise are described below. 


Dr. Romina Brignardello-Petersen: Assistant Professor at the Department of Health Research Methods, 
Evidence, and Impact, at McMaster University. Dr. Brignardello-Petersen obtained a DDS degree 
(University of Chile) in 2007, an MSc degree in Clinical Epidemiology and Health Care Research 
(University of Toronto) in 2012, and MSc in Biostatistics (University of Chile) in 2015, and a PhD in 
Clinical Epidemiology and Health Care Research (University of Toronto) in 2016. Dr. Brignardello- 
Petersen has worked in evidence synthesis projects since 2010, and her research has focused on the 
methodology for the development of Systematic Reviews and Clinical Practice Guidelines since 2012. 
Through January 2022, she has published 122 peer reviewed scientific articles (24 as a first author and 9 
as a senior author). Dr. Brignardello-Petersen has acted as a research methodologist for several groups 
and organizations, including the World Health Organization, the Pan-American Health Organization, the 
American Society of Hematologists, the American College of Rheumatology, and the Society for Evidence 
Based Gender Medicine, among others. Her research program has been awarded over $2M CAD from 
the Canadian Institutes for Health Research. Dr. Brignardello-Petersen has no lived experience as a 
person or family member of a person with gender dysphoria, and her research interests are not in this 
area. 


Dr. Wojtek Wiercioch: Postdoctoral Research Fellow at the Department of Health Research Methods, 
Evidence, and Impact, at McMaster University. Dr. Wiercioch obtained an MSc degree (2014, McMaster 
University) and a PhD degree (2020, McMaster University) in Health Research Methodology. Dr. 
Wiercioch has worked in evidence syntheses projects since 2011, and his research focuses on evidence 
synthesis, guideline development methodology, and the guideline development process. Through April 
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2022, he has published 86 peer-reviewed scientific articles. Dr. Wiercioch has acted as a guideline 
methodologist for several groups and organizations, including the World Health Organization, the 
American Society of Hematologists, the Endocrine Society (of America), and the American Association 
for Thoracic Surgeons, among others. Dr. Wiercioch has no lived experience as a person or family 
member of a person with gender dysphoria, and his research interests are not in this area. 


3. Methods 


We conducted an overview of systematic reviews. We used a reproducible approach to search, select, 
prioritize, appraise, and synthesize the available evidence, following high methodological standards. We 
describe full details of the methodology in an accompanying document. 


In brief, we searched for systematic reviews published in English language in Epistemonikos, OVID 
Medline, and grey literature sources, through April 30, 2022. We selected systematic reviews which 
included studies on young individuals with a diagnosis of gender dysphoria, who received puberty 
blockers, cross-sex hormones, or surgeries; and in which authors reported data regarding outcomes 
important to patients: gender dysphoria, depression, anxiety, quality of life, suicidal ideation, suicide, 
adverse effects, and complications. Systematic reviews could have included any type of primary study 
design. 


The two reviewers screened all titles and abstracts, followed by full text of potentially relevant 
systematic reviews. We then prioritized the most useful systematic review providing evidence for each 
of the outcomes, using pre-established criteria that considered date of publication, applicability, 
availability of outcome data, methodological quality of the systematic review, and usefulness of the data 
synthesis conducted in the systematic review (see methods document for details). 


After abstracting data from the systematic reviews, we synthesized the best available evidence for each 
of the outcomes, and assessed the certainty (also known as quality) of the evidence using the Grading of 
Recommendations Assessment, Development, and Evaluation (GRADE) approach. We conducted GRADE 
assessments using the information provided by the systematic review authors (risk of bias of primary 
studies, characteristics of included studies, results reported by the studies). We present the all the 
information about outcomes in GRADE summary of findings tables. 


In addition, to evaluate the robustness of our conclusions, we systematically searched for and evaluated 
primary studies answering the questions of interest published after the authors of the included 
systematic reviews conducted their searches. 


4. Results 


We included 61 systematic reviews, from which 3 addressed the effects of puberty blockers, 22 
addressed the effects of cross-sex hormones, 30 addressed the effects of surgeries, and 6 addressed the 
effects of more than one of these interventions. After our prioritization exercise, we included 
information from 2 systematic reviews on puberty blockers, 4 on cross-sex hormones, and 8 on 
surgeries. 


4.1 Puberty blockers 
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For most outcomes (except suicidality), there is no evidence about the effect of puberty blockers 
compared to not using puberty blockers. In other words, no studies compared the outcomes between a 
group of people with gender dysphoria using puberty blockers and another group of people with gender 
dysphoria not using them. Therefore, it is unknown whether people with gender dysphoria who use 
puberty blockers experience more improvement in gender dysphoria, depression, anxiety, and quality of 
life than those with gender dysphoria who do not use them. There is very low certainty about the effects 
of puberty blockers on suicidal ideation. 


The studies included in the systematic review reported outcomes among a group of people with gender 
dysphoria after receiving puberty blockers. Low certainty evidence suggests that after treatment with 
puberty blockers, people with gender dysphoria experience a slight increase in gender dysphoria, and an 
improvement in depression, and anxiety. Low certainty evidence also suggests that a moderate 
percentage of patients experience adverse effects. The findings must be interpreted considering that 
these studies did not have a comparison group, and that it is unknown if people with gender dysphoria 
that do not use puberty blockers experience similar or different outcomes. 


4.2 Cross sex hormones 

For almost all outcomes (except breast cancer) there is no evidence about the effect of cross sex 
hormones compared to not using cross sex hormones. In other words, no studies compared the 
outcomes between a group of people with gender dysphoria using cross sex hormones and another 
group of people with gender dysphoria not using them. Therefore, it is unknown whether people with 
gender dysphoria who use cross-sex hormones experience more improvement in gender dysphoria, 
depression, anxiety, quality of life, and suicidality than those with gender dysphoria who do not use 
cross-sex hormones. There is low certainty evidence suggesting that cross-sex hormones may not 
increase the risk of breast cancer. 


The studies included in the systematic reviews reported changes in the outcomes among a group of 
patients with gender dysphoria after the use of cross-sex hormones. Low certainty evidence suggests 
that after treatment with cross-sex hormones, people with gender dysphoria experience an 
improvement in gender dysphoria, depression, anxiety, and suicidality. There is very low certainty 
evidence about the changes in quality of life. There is moderate certainty evidence suggesting a low 
prevalence of venous thromboembolism after treatment with cross-sex hormones. The findings must be 
interpreted considering that these studies did not have a comparison group, and that it is unknown if 
people with gender dysphoria that do not use cross-sex hormones experience similar or different 
outcomes. 


4.3 Surgeries 

There were no systematic reviews and studies reporting on gender dysphoria, depression, anxiety, and 
suicidality. Therefore, the effects of surgeries on these outcomes (when compared to a group of patients 
with gender dysphoria who do not undergo surgery), or the changes in these outcomes (improvements 
or deterioration) among patients who undergo any gender-affirming surgery is unknown. Because of the 
lack of comparative studies, it is also unknown whether people with gender dysphoria who undergo 
surgeries experience more improvement in quality of life or less regret than those with gender 
dysphoria who do not undergo any surgeries. There is low certainty evidence suggesting that a low 
percentage of participants experience regret, and very low certainty evidence about changes in quality 
of life after surgery. 
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In assigned females at birth, low certainty evidence suggests that a high percentage of people are 
satisfied after chest surgery. There is very low certainty evidence, however, about satisfaction after 
bottom surgery, and about complications after both chest and bottom surgery. In assigned males at 
birth, low certainty evidence suggests a high percentage of people satisfied and a low percentage of 
people experiencing regret after vaginoplasty. There is very low certainty, however, about satisfaction 
with chest surgery and complications and reoperations after bottom surgery. 


4.4 Evidence published after the systematic reviews selected 
We found 10 relevant studies that were published after the systematic reviews were conducted. This 
evidence was not sufficient to importantly change the conclusions previously made. 


5. Discussion 


5.1 Summary of the evidence 

In this report, we systematically summarized the best available evidence regarding the effects of 
puberty blockers, cross-sex hormones, and surgeries in young people with gender dysphoria. We did not 
find evidence about the effect of these interventions on outcomes important to patients when 
compared to not receiving the intervention. We found low and very low certainty evidence suggesting 
improvements in gender dysphoria, depression, anxiety, and quality of life, as well as low rates of 
adverse events, after treatment with puberty blockers and cross-sex hormones. 


5.2 Completeness and applicability 

There are several gaps in the evidence regarding the effects of puberty blockers, cross-sex hormones, 
and surgeries in patients with gender dysphoria. Although we found some evidence for all the outcomes 
of interest, the evidence is suboptimal: several limitations included the lack of studies with a comparison 
group, and the risk of bias and imprecision, resulting in low or very low certainty evidence for all 
outcomes. 


The applicability of the evidence may also be limited. Although we only rated down for indirectness 
when it was considered a serious problem (i.e., in evidence about the effects of surgeries, which was 
collected from people who were importantly older than the target population in this report), there are 
also potential applicability issues to consider in the evidence regarding the effects of puberty blockers 
and cross-sex hormones. It is not clear to what extent the people included in the studies were similar 
enough to the people seeking these treatment options today. For example, some of the included studies 
were conducted in people who had a diagnosis of gender dysphoria confirmed with strict criteria, as well 
as a supportive environment. It is important to take into account to what extent this may compromise 
the applicability of the results to people who are not in the same situation. 


5.3 Strengths and limitations of the process for developing this report 

We followed a reproducible process for developing this report. We used the highest methodological 
standards and the approach to evidence synthesis we generally use when supporting organizations in 
the development of their guidelines. This approach is based on prioritizing the sources of evidence most 
likely to be informative (i.e., to identify and use the evidence with the highest certainty level). 


To follow the principles for evidence-based decision making, which require using the best available 
evidence to inform decisions, we summarized the best available evidence. Because knowing the best 
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available evidence necessitates being aware of all the available evidence, we based this report on 
systematic reviews of the literature. We chose the most trustworthy and relevant systematic reviews 
among many published reviews. 


One potential limitation of the process is that, due to feasibility concerns, we relied on the information 
reported by the systematic reviewers. Most of the systematic reviews we used, unfortunately, were 
judged at moderate or low methodological quality, which may raise concerns about the trustworthiness 
of the evidence presented in this report. We believe, however, that the results and conclusions of this 
report would not be importantly different had the systematic reviews been conducted following higher 
methodological standards. Because there are no randomized controlled trials, well-conducted 
comparative observational studies, or very large case series (which include a large sample of consecutive 
patients who are representative of the whole population) addressing the effects of puberty blockers, 
cross-sex hormones, and surgeries; the certainty of the evidence about the effects of these 
interventions is likely to continue being low or very low, even if a few more studies are included (as 
observed after searching for primary studies published after the reviews were conducted) or some data 
points were reported inaccurately in the systematic reviews. 


Also due to feasibility concerns, the scope of this report was limited to outcomes that are important to 
patients. Although some may question the decision of not including surrogate outcomes for which there 
is evidence available (e.g. bone density, blood pressure), decision makers should rarely consider these 
outcomes and should instead focus on outcomes that do matter to people and stakeholders (e.g., 
fractures, cardiovascular events). 


5.4 Implications 

The evidence evaluating the effects of puberty blockers, cross-sex hormones, and surgeries in people 
with gender dysphoria has important limitations. Therefore, decisions regarding their use should 
carefully consider other relevant factors. At a patient level, these factors include patients’ values and 
preferences (how patients trade off the potential benefit and harms - what outcomes are more 
important to them), and resources needed to provide the interventions (and the availability of such 
resources). At a population level, in addition to these factors, it would be important to consider 
resources needed to implement the interventions, feasibility, acceptability by relevant stakeholders, and 


equity. 


It is important to note that when there is low or very low certainty evidence, it is rarely appropriate to 
make decisions that will be applied to the majority of the patients (equivalent to strong 
recommendations). This implies, at the patient level, that shared decision making is a key part of the 
decision-making process. At a policy level, extensive debate may be needed. 


6. Conclusions 


Due to the important limitations in the body of evidence, there is great uncertainty about the effects of 
puberty blockers, cross-sex hormones, and surgeries in young people with gender dysphoria. This 
evidence alone is not sufficient to support whether using or not using these treatments. We encourage 
decision makers to be explicit and transparent about which factors play an important role in their 
decision, and how they are weighed and traded off. 
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Methods 


To ensure completeness and feasibility of the evidence review, we used an approach in which 
we prioritized the types of studies according to the design that was more likely to provide the 
best available evidence. First, we searched for systematic reviews of the literature. Second, we 
appraised all existing systematic reviews to select the most trustworthy (highest 
methodological quality, most up-to-date, most applicable) from which to draw conclusions. 
Third, we used the information presented in the systematic reviews to abstract information 
regarding the effects of the interventions of interest. Fourth, we assessed the certainty of the 
evidence (also known as quality of the evidence) abstracted from the selected systematic 
reviews. We planned to search for primary studies if systematic reviews were not found. 


Information sources: We searched for existing systematic reviews in: 

1. Epistemonikos (https://www.epistemonikos.org), an electronic database that focuses on 
systematic reviews. We used a comprehensive search strategy based on the population, 
using the terms “gender dysphoria”, “gender identity disorder” and “transgender”. We 
conducted this search on April 23, 2022. 

2. OVID Medline. We used a search strategy based on the population and the interventions 
of interest, as well as an adaptation of a filter for systematic reviews from the Health 
Information Research Unit at McMaster University. We conducted this search on April 
23, 2022. 

3. Grey literature: we conducted a manual search in the websites of specific health 
agencies: National Institutes for Health and Care Excellence (NICE), Agency for 
Healthcare Research and Quality (AHRQ), Canada’s Drug and Health Technology Agency 
(CADTH), and the website from the Society for Evidence-Based Gender Medicine 
(SEGM). We conducted these searches between April 27-30, 2022. 


We used no date limits for the searches, but we did limit to systematic reviews published in 
English. Search strategies are available in Appendix 1. 


Eligibility criteria: We included systematic reviews, which we defined as: 

1. Reviews in which the authors searched for studies to include in at least one electronic 
database, and in which there were eligibility criteria for including studies and a 
methodology for assessing and synthesizing the evidence, or 

2. Reviews in which the authors searched for studies to include in at least one electronic 
database, and although there was no description of eligibility criteria or methodology, 
the presentation of the results strongly suggested that the authors used systematic 
methods (e.g. flow chart depicting study selection, tables with the same information 
from all included studies, synthesis of data at the outcome level). 


We screened systematic reviews using the following criteria for inclusion: 
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oO. Type of participants: Young individuals (< 25 years old) with a diagnosis of gender 
dysphoria/gender identity disorder. We included reviews in which authors used any label 
and diagnostic criteria for this condition. We included reviews in which the participants in 
the reported studies were older if it was the only evidence available for a specific question. 
We excluded reviews with mixed populations (i.e. with and without gender dysphoria) in 
which people without gender dysphoria constituted more than 20% of the total sample. 


© Type of Interventions: Puberty blockers, cross-sex hormones, gender affirming surgeries. We 
included any type of puberty blockers and cross-sex hormones, provided with any regimen. 
We included the following surgeries: phalloplasty, vaginoplasty, and chest surgery 
(mastectomy or breast implants/augmentation). We only included these when they were 
performed for the first time (i.e., not revision surgeries). 


o Type of comparison: When the systematic reviews included comparative studies, the 
comparator of interest was no intervention. Participants could have received psychotherapy 
or counselling as a cointervention (in both groups). 


© Type of outcomes: Gender dysphoria, mental health outcomes (depression and anxiety), 
quality of life, suicidal ideation, suicide, adverse effects (for puberty blockers and cross-sex 
hormones only), and satisfaction, complications, reoperation, and regret (for surgeries only). 
We included any length of follow-up. We excluded surrogate outcomes such as blood 
pressure, bone mineral density, kidney or liver function test values, etc. 


oO. Type of studies included in the systematic reviews: Any clinical study (studies in which the 
researchers recruited and measured outcomes in humans) regardless of study design. This 
included randomized clinical trials, comparative observational studies, and case series. 
Because we could not quantify effect measures, incidence, or prevalence, we excluded case 
reports. 


We excluded systematic reviews published only in abstract format, and those that we could not 
retrieve in full text (no access through the McMaster University library, or open access online). 


Selection process: The two reviewers screened all titles and abstracts independently and in 
duplicate, followed by screening of full texts of potentially eligible systematic reviews 
independently and in duplicate, using the systematic review online application Covidence 
(https://www.covidence.org). We solved disagreements by consensus. 


To select the most useful systematic reviews among all of those that met the eligibility criteria, 
we used the following prioritization criteria: 


1. Date of publication: we prioritized systematic reviews published within the last 3 years 
(2020-2022) 
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2. Match between eligibility criteria of the review and the question of interest: we 
prioritized reviews in which the authors specifically included the population, 
intervention, comparison, and outcomes of interest for this evidence review 

3. Outcome data available: we prioritized systematic reviews in which the authors report 


outcome data 


4. Methodological quality: we used a modified version of the items in AMSTAR 2.1 We 
modified the items to ensure assessment of methodological rather than reporting 
quality (Table 1). We rated each systematic review as having high, moderate, low, or 
critically low methodological quality, according to the guidance from the developers of 


the tool.’ We reached consensus on critical items that determined this rating (Table 1). 
We prioritized selection of systematic reviews with highest methodological quality. 


For surgical interventions, in addition, we prioritized systematic reviews that covered all gender 
affirming surgeries (instead of focusing on a specific type of surgery). 


We selected a systematic review specifically for each of the outcomes of interest. In other 
words, we chose the best systematic review to inform each outcome. Each systematic review, 


however, could inform more than one outcome. 


Table 1: Items used to rate the methodological quality of the eligible systematic reviews 


AMSTAR Item 


Modification to measure methodological 
quality 


1. Did the research questions and inclusion criteria for 
the review include the components of PICO? 


Does the review have a clear question and are the 
eligibility criteria for studies consistent with the 


question? 


2. Did the report of the review contain an explicit 
statement that the review methods were established 
prior to the conduct of the review and did the report 
justify any significant deviations from the protocol? 


No modification needed 


3. Did the review authors explain their selection of 
the study designs for inclusion in the review? 


No modification needed 


4. Did the review authors use a comprehensive 
literature search strategy? 


Did the authors search in at least 2 electronic 
databases, using a reproducible search strategy? 


5. Did the review authors perform study selection in 


duplicate? No modification needed 
6. Did the review authors perform data extraction in 
duplicate? No modification needed 


7. Did the review authors provide a list of excluded 
studies and justify the exclusions? 


No modification needed 


8. Did the review authors describe the included 
studies in adequate detail? 


No modification needed 


9. Did the review authors use a satisfactory technique 
for assessing the risk of bias (RoB) in individual studies 
that were included in the review? 


No modification needed 
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Did the review authors consider conflicts of interest 


10. Did the review authors report on the sources of and how they may have affected the results of the 
funding for the studies included in the review? primary studies? 

11. If meta-analysis was performed, did the review Was the synthesis of evidence done appropriately? 
authors use appropriate methods for statistical (outcome level, appropriate meta analysis or narrative 
combination of results? synthesis) 

12. If meta-analysis was performed, did the review 

authors assess the potential impact of RoB in Did authors use subgroup or sensitivity analysis to 
individual studies on the results of the meta-analysis assess the effect of risk of bias in meta-analytic 

or other evidence synthesis? results? Likely not applicable to most cases 


13. Did the review authors account for RoB in primary | Did the review authors incorporate an assessment of 
studies when interpreting/discussing the results of the | risk of bias at the outcome level when drawing 


review? conclusions? 

14. Did the review authors provide a satisfactory Did the review authors incorporate an assessment of 
explanation for, and discussion of, any heterogeneity | heterogeneity at the outcome level when drawing 
observed in the results of the review? conclusions? 


——| 


15. If they performed quantitative synthesis did the 
review authors carry out an adequate investigation of | Did the authors address publication bias? (regardless 
publication bias (small study bias) and discuss its likely | of whether synthesis was using a meta-analysis or 


impact on the results of the review? narrative) 

16. Did the review authors report any potential 

sources Did the authors report conflicts of interest and did 
of conflict of interest, including any funding they they manage any existing conflict of interest 
received for conducting the review? appropriately? 


Shaded items were items considered critical. 


Data abstraction: We abstracted outcome data from each of the systematic reviews. To ensure 
feasibility, we used the data as reported by the authors of the review and did not re-abstract 
data from the primary studies. One reviewer abstracted the data and a second reviewer 
checked the data for accuracy. 


Data synthesis: Using the systematic reviews prioritized, we synthesized the evidence at the 
outcome level. Because of the higher likelihood of it resulting in higher certainty of evidence 
(details below) for each outcome, when there was comparative data (i.e. comparison of 
outcomes between an untreated and a treated group) and non-comparative data (i.e. changes 
from before to after treatment in one group, or only outcomes after treatment), we prioritized 
comparative data. 


We prioritized numerical results (i.e. magnitudes of effect) and reported estimates and their 
95% confidence intervals (Cls). When results were not reported in that way, we calculated the 
estimates and Cls when systematic review authors provided sufficient information. When 
necessary, we assumed moderate correlation coefficients for the changes between baseline 
and follow up (coefficient= 0.4). When this information was not available we reported 
narratively the effect estimates and ranges. 


When a specific study reported the same outcome measured by more than one scale, we chose 
the scale presented first. We highlighted situations when the results obtained with other scales 
were importantly different. 
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When the same outcome was reported by more than one study but we could not pool the 
results, we created narrative syntheses. 


Certainty of evidence: For each outcome, we assessed the certainty of the evidence (also 
known as quality of the evidence) using the Grading of Recommendations Assessment, 
Development, and Evaluation (GRADE) approach. The certainty of evidence can be rated as 
high, moderate, low, or very low (Table 2). For effects of interventions, the certainty of the 
evidence started as high and could be rated down due to serious concerns about risk of bias, 
inconsistency, indirectness, imprecision, and publication bias. For inferences about the effect of 
using a treatment versus no treatment, when there was no comparison group, we assessed risk 
of bias as very serious and rated down the certainty of the evidence 2 levels by default. We 
used the same principles when assessing the certainty of the evidence in estimates of 
prevalence or rates, but did not judge risk of bias as resulting in very serious concerns due to 
lack of a comparison group. For all assessments, we used the information presented by the 
authors of the systematic review (e.g. assessments of risk of bias of the included studies, effect 
estimates from studies). 


Table 2: GRADE levels of certainty of the evidence 


Certainty level Definition 

High We are very confident that the true result (effect estimate/ prevalence/ 

®SOe mean, etc.) lies close to that of the estimate of the result 

Moderate We are moderately confident in the result: the true result is likely to be 

®86O0 close to the estimate of the result, bur there is a possibility that it is 
substantially different 

Low Our confidence in the result is limited: the true result may be 

®80OO substantially different from the estimate of the result 

Very low We have very little confidence in the result: the true result is likely to 

®OO0O be substantially different from the estimate of the result 


Presentation of results: We created GRADE Summary of Findings tables in which we describe 
the evidence available for each of the outcomes, and the certainty of the evidence. These 
tables contain the following information: 


- Outcomes: measurement method (including scales, if applicable) and follow-up 

- Estimates of effect: absolute and relative estimates of effect, and their corresponding 
95% Cls. 

- Number of studies and participants providing evidence for the outcome 

- GRADE certainty of the evidence, with a link to detailed explanations (provided at the 
bottom of the table) of why the certainty of the evidence was rated at a specific level 

- Anarrative statement about what happens with the outcome, based on the estimate of 
effect and certainty of evidence. 
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Searching for new evidence not included in the systematic reviews: To assess if newer 
evidence not included in the included systematic reviews would change the conclusions 
importantly, we searched for and assessed primary studies answering the questions of interest 
that were published after the authors of such systematic reviews conducted their searches. We 
defined an important change in conclusions as a change in the certainty of the evidence (from 
low/ very low/ not available to high/ moderate). 


We searched OVID Medline from January 1, 2019 through May 12, 2022, for studies published 
in English. We included studies if they enrolled young individuals (< 25 years old, with at least 
20% of the people being this age) with a diagnosis of gender dysphoria/gender identity 
disorder, who received puberty blockers, cross-sex hormones, or surgeries; and measured any 
of the outcomes of interest. 


For outcomes that should be evaluated in a comparative manner (e.g., depression, anxiety, 
etc.), because they are the only type of study design that would change the conclusions 
importantly, we selected comparative clinical studies (studies in which the researchers 
recruited and measured outcomes in humans, and compared a group of people who received 
the intervention with another one who did not receive the intervention). This included 
randomized clinical trials, and comparative observational studies. For outcomes that can only 
occur when the treatment is administered, we included non-comparative observational studies 
(case series). For these to change conclusions, they should have a sufficiently large sample size, 
and therefore we excluded case series in which the researchers reported information from 
<100 people. 


Two reviewers screened the potentially relevant articles at title and abstract and full text 
screening stage. We abstracted relevant study characteristics and outcome data, and assessed 
risk of bias of comparative studies using the most relevant domains of the Risk of Bias for non- 
Randomized studies of Interventions (ROBINS-I) tool? (table 3). For non-comparative studies, 
we used a list of custom items that captured the most important potential risk of bias concerns 
of case series (table 4). We judged the risk of bias of each study as the highest risk of bias of any 
of the domains assessed (e.g., one domain judged at critical risk of bias resulted in the study 
judged at critical risk of bias). We summarized this information at the study and judged whether 
it would have changed the conclusions importantly if added to the body of evidence from the 
systematic reviews. 


Table 3: Domains used to assess risk of bias of comparative studies 


Domain Low Critical 

Adjusted for all relevant confounding 
Confounding factors No adjustment 
Classification of Intervention recorded prospectively or Asked patients to recall whether they 
intervention from medical records received the intervention 
Deviation from intended No cointerventions or cointerventions Cointerventions unbalanced between the 
interventions balanced between the groups groups 
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evidence. Dr. Romina Brignardello-Petersen and Dr. Wojtek Wiercioch; Methods; May 16, 2022 


More than 90% of patients who started Less thank 50% of patients who started 


Missing data the study provided outcome data the study provided outcome data 
Measurement of All outcomes measured in the same way Outcomes measured differently in both 
outcome in both groups groups 


Each domain could be judged at low, moderate, serious, or critical risk of bias. In addition, information could be 
insufficient to make a judgment. The table describes the criteria used to judge a domain in the extreme categories. 


Table 4: Domains used to assess risk of bias of non- comparative studies 


Domain Low High 
Highly selected sample based on specific 
Representativeness of characteristics related with the prognosis 
the sample Included all consecutive patients after treatment 
Classification of the Intervention recorded prospectively or Asked patients to recall whether they 
intervention from medical records received the intervention 
No cointerventions outside what would be 
Deviation from intended observed in practice (or in a small Most patients received co interventions 
interventions proportion of patients) that could influence the outcomes 
More than 90% of patients who started the Less thank 50% of patients who started 
Missing data study provided outcome data the study provided outcome data 
Outcomes reported by the patients and/ 
Measurement of Outcomes measured prospectively or from or needed to recall what happened a long 
outcome medical records time ago 


Each domain could be judged at low, moderate, or high risk of bias. In addition, information could be insufficient to 
make a judgment. The table describes the criteria used to judge a domain in the extreme categories. 
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Search Strategies 
Questions Covered: 
PICO questions: 
1. For children, adolescents, and young adults (<21) with gender dysphoria, what are the effects of 
treatment with puberty blockers (gonadotrophin releasing hormone (GnRH) analogues) 


compared to no puberty blockers? 


2. For children, adolescents, and young adults (<21) with gender dysphoria, what are the effects of 
treatment with cross-sex hormones compared to no cross-sex hormones? 


3. For children, adolescents, and young adults (<21) with gender dysphoria, what are the effects of 
gender-affirming surgeries compared to no surgery? 


Search Strategies: 
Note: Population, puberty blocker, cross-sex hormones search blocks adapted from NICE (2020) 
evidence reviews. Gender-affirming search block adapted from Wernick et al. 2019. Systematic reviews 


filter adapted from McMaster University Health Information Research Unit (HIRU). 


Databases: Medline, Epistemonikos 
Grey Literature: CADTH, AHRQ, SEGM, NICE 


Medline 
OVERVIEW 
Interface: Ovid 
Databases: OVID Medline Epub Ahead of Print, In-Process & Other Non-Indexed Citations, 


Ovid MEDLINE(R) Daily and Ovid MEDLINE(R) 1946 to Present 
Study Types: Systematic Reviews 
Search Run: April 23, 2022 


Search Strategy: search terms [number of results] 


Population 


1 exp "Sexual and Gender Minorities"/ 12385 

2 Gender Dysphoria/ 774 

3 Gender Identity/ 20481 

4 GenderRole/ 197 

5 "Sexual and Gender Disorders"/ 81 

6 Transsexualism/ 4236 

7 Transgender Persons/ 5303 

8 Health Services for Transgender Persons/ 186 


13 


exp Sex Reassignment Procedures/ 1208 

gender identity disorder.mp. 492 

non-binary.mp. 566 

transgender.mp. 9989 

(gender* adj3 (dysphori* or disorder* or distress or nonconform* or non-conform* or 


atypical or incongru* or identi* or disorder* or confus* or minorit* or queer* or variant or 
diverse or creative or explor* or question* or expan* or fluid)).tw. 16428 


14 


((sex or gender*) adj3 (reassign* or chang* or transform* or transition* or 


expression*)).tw. 13749 


15 (transgend* or transex* or transsex* or transfem* or transwom* or transma* or 
transmen* or transperson* or transpeopl*).tw. 19665 
16 (genderfluid or genderqueer or agender).mp. 130 


17 
18 


19 


((correct or chosen) adj3 name).mp. 591 

(trans or crossgender* or cross-gender* or crossex* or cross-sex* or genderqueer*).tw. 
135313 

((sex or gender*) adj3 (reassign* or chang* or transform* or transition* or 


expression*)).tw. 13749 


20 
21 


(male-to-female or m2f or female-to-male or f2m).tw. 148579 
or/1-20 342948 


Cross-Sex Hormones 


22 
23 
24 
25 
26 
27 


Hormones/ad, tu,th 4676 

exp Progesterone/ad, tu, th 11265 

exp Estrogens/ad, tu, th29635 

exp Gonadal Steroid Hormones/ad, tu, th 35375 

(progesteron* or oestrogen* or estrogen*).tw. 223307 

((cross-sex or crosssex or gender-affirm*) and (hormon* or steroid* or therap* or 


treatment* or prescri* or pharm* or medici* or drug* or intervention* or care)).tw. 1488 


28 exp Estradiol/ad, tu, th 11197 

29 exp Testosterone/ad, tu, th 8710 

30 (testosteron* or sustanon* or tostran or testogel or testim or restandol or andriol or 
testocaps* or nebido or testavan).tw. 86509 


31 


(oestrad* or estrad* or evorel or ethinyloestrad* or ethinylestrad* or elleste or progynova 


or zumenon or bedol or femseven or nuvelle).tw. 100252 


32 or/22-31 345895 
Puberty Blockers 
33 Gonadotropin-Releasing Hormone/ 28809 
34 (pubert* adj3 block*).ti,ab. 141 
35 ((gonadotrophin or gonadotropin) and releasing).ti,ab. 20121 


36 
a? 
38 
39 
40 
41 


(GnRH adj2 analog*).ti,ab. 2878 
GnRH*.ti,ab. 24390 

"GnRH agonist*".ti,ab. 4749 
Triptorelin Pamoate/ 1981 
triptorelin.ti,ab.821 

arvekap.ti,ab. 1 


("AY 25650" or AY25650).ti,ab. 1 

("BIM 21003" or BIM21003).ti,ab. 0 
("BN 52014" or BN52014).ti,ab. 0 

("CL 118532" or CL118532).ti,ab. 0 
Debio.ti,ab. 119 

diphereline.ti,ab. 28 

moapar.ti,ab. 0 

pamorelin.ti,ab.1 

trelstar.tiab. 3 

triptodur.ti,ab. 1 


("WY 42422" or WY42422).ti,ab. 0 
("WY 42462" or WY42462).ti,ab. 0 
gonapeptyl.ti,ab. 0 
decapeptyl.ti,ab. 225 


salvacyl.tiab. 0 

Buserelin/ 2137 
buserelin.ti,ab. 1395 

onist.ti,ab. 0 

("hoe 766" or hoe-766 or hoe766).ti,ab. 72 
profact.ti,ab. 2 

receptal.ti,ab. 31 
suprecur.ti,ab. 5 

suprefact.ti,ab. 25 

tiloryth.ti,ab. 0 

histrelin.ti,ab. 78 
"LHRH-hydrogel implant".ti,ab. 1 
("RL 0903" or RLO903).ti,ab. 1 
("SPD 424" or SPD424).ti,ab. 1 
goserelin.ti,ab. 1016 

Goserelin/ 1643 

("ici 118630" or ici118630).ti,ab. 51 
("ZD-9393" or ZD9393).ti,ab. 0 
zoladex.ti,ab. 388 
leuprorelin.ti,ab. 525 
carcinil.ti,ab. 0 

enanton*.ti,ab. 26 

ginecrin.ti,ab. 0 

leuplin.tiab. 15 

Leuprolide/ 3018 
leuprolide.ti,ab. 2004 

lucrin.ti,ab. 16 

lupron.ti,ab. 183 
provren.ti,ab. 0 

procrin.ti,ab. 3 

("tap 144" or tap144).ti,ab. 41 


(a-43818 or a43818).ti,ab. 3 
Trenantone.ti,ab. 2 
staladex.ti,ab. 0 


90 prostap.ti,ab. 6 

91 Nafarelin/ 327 

92. nafarelin.ti,ab. 263 

93 ("76932-56-4" or "76932564").ti,ab. 
94 ("76932-60-0" or "76932600").ti,ab. 
95 ("86220-42-0" or "86220420").ti,ab. 
96 ("rs 94991 298" or rs94991298).ti,ab. 
97 synarel.tiab. 13 

98 deslorelin.ti,ab. 306 

99 gonadorelin.ti,ab. 237 

100 ("33515-09-2" or "33515092").ti,ab. 
101 ("51952-41-1" or "51952411").ti,ab. 
102 ("52699-48-6" or "52699486").ti,ab. 
103 cetrorelix.ti,ab. 520 

104 cetrotide.ti,ab. 52 

105 ("NS 75A" or NS75A).ti,ab. 0 
106 ("NS 75B" or NS75B).ti,ab. 0 
107 ("SB 075" or SBO75).ti,ab. 1 
108 ("SB 75" or SB75).ti,ab. 67 

109 gonadoliberin.ti,ab. 151 

110 kryptocur.ti,ab. 7 

111 cetrorelix.ti,ab. 520 

112 cetrotide.ti,ab. 52 

113 antagon.ti,ab. 18 

114 ganirelix.ti,ab. 160 

115 ("ORG 37462" or ORG37462).ti,ab. 3 
116 orgalutran.ti,ab. 26 

117 ("RS 26306" or RS26306).ti,ab. 5 

118 ("AY 24031" or AY24031).ti,ab. 0 

119 factrel.tiab. 13 

120 fertagyl.ti,ab. 12 

121 lutrelef.tiab. 5 

122 lutrepulse.ti,ab. 3 

123 relefact.ti,ab. 10 

124 fertiral.tiab. 0 

125 (hoe471 or "hoe 471").tiab. 6 

126 relisorm.ti,ab. 4 

127 cystorelin.ti,ab. 19 

128 dirigestran.ti,ab. 5 

129 or/33-128 47108 


ooo°o 


ooo 


Gender-affirming Surgeries 


130 virilization/ 2309 

131 (virilism or virili?ation or masculini?ation).mp. 5657 
132 feminization/ 797 

133 femini?ation.mp. 3420 

134 (vaginoplasty or vaginoplasties).mp. 1022 


135 exp Vagina/ or *Reconstructive Surgical Procedures/ 78841 
136 (vaginoplasty or vaginoplasties).mp. 1022 

137 (phalloplasty or phalloplasties).mp. 561 

138 exp Penile Prosthesis/ 1636 

139 "penile reconstruction".mp. 292 

140 (vagina reconstruction or vaginal reconstruction).mp. 549 
141 (genitoplasty or genitoplasties).mp. 263 

142 transsexualism/su [Surgery] 1007 

143 sex reassignment.mp. 1668 

144 sex transformation.mp. 42 

145 or/130-144 91560 


Systematic Review Filter 


147 meta-analysis/ 158633 

148 (meta anal* or meta-anal* or metaanal*).ti,ab. 231876 

149 ((systematic or evidence) adj2 (review* or overview*)).ti,ab. 279806 

150 ((pool* or combined) adj2 (data or trials or studies or results)).ab. 65411 
151 (search strategy or search criteria or systematic search or study selection or data 
extraction).ab. 70886 

152 (search* adj4 literature).ab. 84593 

153 or/146-152 521554 


Combine Interventions and Population 


154 32 or 129 or 145 459771 
155 21 and 154 17838 


Limit to Systematic Reviews in English Language 


156153and155 295 
157 limit 156 to english language 288 


Epistemonikos 


OVERVIEW 


Interface: https://www.epistemonikos.org/ 


Database: Epistemonikos 
Study Types: Systematic Reviews 
Search Run: April 23, 2022 


Search Strategy: search terms [number of results] 


Population 


(title:((title:(gender dysphoria) OR abstract:(gender dysphoria)) OR (title:(gender identity disorder) 
OR abstract:(gender identity disorder)) OR (title:(transgender) OR abstract:(transgender))) OR 
abstract:((title:(gender dysphoria) OR abstract:(gender dysphoria)) OR (title:(gender identity 
disorder) OR abstract:(gender identity disorder)) OR (title:(transgender) OR 
abstract:(transgender)))) 


Limit to Systematic Reviews 


*Limited by publication type “systematic review” [425] 


Canadian Agency for Drugs and Technologies in Health (CADTH) 


OVERVIEW 


Interface: https://www.cadth.ca/ 

Database: CADTH 

Study Types: Systematic Reviews, Health Technology Reviews 
Search Run: April 27, 2022 


Search Strategy: search terms [number of results] 


“gender dysphoria” [10] 
Limit to Health Technology Review [2] 


“transgender” [9] 
Limit to Health Technology Review [5} 


“gender identity disorder” [1] 


an 


Agency for Healthcare Research and Quality (AHRQ) 


Interface: https://search.ahrg.gov/ 
Database: AHRQ 


Study Types: Evidence Based Practice (EPC) Centre Reports, Full Research Reports, Health 
Technology Assessments 


Search Run: April 29, 2022 


Search Strategy: search terms [number of results] 


wn, wn, 


Search titles only: "gender identity disorder" "gender dysphoria" "transgender" [7] 


Society for Evidence-based Gender Medicine (SEGM) 


OVERVIEW 

Interface: https://segm.org/news i 
Database: SEGM News 

Study Types: Systematic Reviews 

Search Run: April 30, 2022 


Search Strategy: search terms [number of results] 


Find in page: "systematic" [5] 


National Institute for Health and Care Excellence (NICE) 


OVERVIEW 

Interface: https://www.nice.org.uk 

Database: NICE 

Study Types: Systematic Reviews, Guidelines with Systematic Reviews 
Search Run: April 30, 2022 


Search Strategy: search terms [number of results] 


gender dysphoria [1] 
Limit to Guidance [1] 


transgender [10] 
Limit to Guidance [7] 


~ 


gender identity disorder [9] 
Limit to Guidance [8] 


Search Strategies — Individual Studies 
Questions Covered: 
PICO questions: 
1. For children, adolescents, and young adults (<21) with gender dysphoria, what are the effects of 
treatment with puberty blockers (gonadotrophin releasing hormone (GnRH) analogues) 


compared to no puberty blockers? 


2. For children, adolescents, and young adults (<21) with gender dysphoria, what are the effects of 
treatment with cross-sex hormones compared to no cross-sex hormones? 


3. For children, adolescents, and young adults (<21) with gender dysphoria, what are the effects of 
gender-affirming surgeries compared to no surgery? 


Search Strategies: 


Note: Population, puberty blocker, cross-sex hormones search blocks adapted from NICE (2020) 
evidence reviews. Gender-affirming search block adapted from Wernick et al. 2019. 


Databases: Medline 


Medline 
OVERVIEW 
Interface: Ovid 
Databases: OVID Medline Epub Ahead of Print, In-Process & Other Non-Indexed Citations, 


Ovid MEDLINE(R) Daily and Ovid MEDLINE(R) 1946 to Present 
Study Types: Any 
Search Run: May 12, 2022 
Search Strategy: search terms [number of results] 


Population 


exp "Sexual and Gender Minorities"/ 12631 
Gender Dysphoria/ 781 

Gender Identity/ 20586 

Gender Role/ 204 

"Sexual and Gender Disorders"/ 81 

Transsexualism/ 4259 

Transgender Persons/ 5371 

Health Services for Transgender Persons/ 187 
exp Sex Reassignment Procedures/ 1211 

10 gender identity disorder.mp. 492 


WAN DUBWNE 


a 


11. non-binary.mp. 574 

12 transgender.mp. 10079 

13 (gender* adj3 (dysphori* or disorder* or distress or nonconform* or non-conform* or 

atypical or incongru* or identi* or disorder* or confus* or minorit* or queer* or variant or 

diverse or creative or explor* or question* or expan* or fluid)).ti,ab. 16546 

14 ((sex or gender*) adj3 (reassign* or chang* or transform* or transition*)).ti,ab. 9375 

15 (transgend* or transex* or transsex* or transfem* or transwom* or transma* or 

transmen* or transperson* or transpeopl*).ti,ab. 19788 

16 (genderfluid or genderqueer or agender).mp. 132 

17 ((correct or chosen) adj3 name).mp. 591 

18 (trans or crossgender* or cross-gender* or crossex* or cross-sex* or genderqueer*).ti,ab. 
135744 

19 (male-to-female or m2f or female-to-male or f2m).ti,ab. 149067 

20 or/1-19 341083 


Cross-sex Hormones 


21 Hormones/ad, tu,th 4690 

22 exp Progesterone/ad, tu, th 11270 

23 exp Estrogens/ad, tu, th29646 

24 exp Gonadal Steroid Hormones/ad, tu, th 35401 

25 (progesteron* or oestrogen* or estrogen*).ti,ab. 223689 

26 ((cross-sex or crosssex or gender-affirm*) and (hormon* or steroid* or therap* or 

treatment* or prescri* or pharm* or medici* or drug* or intervention* or care)).ti,ab. 
1507 

27 exp Estradiol/ad, tu, th 11200 

28 exp Testosterone/ad, tu, th 8722 

29 (testosteron* or sustanon* or tostran or testogel or testim or restandol or andriol or 

testocaps* or nebido or testavan).ti,ab. 86670 

30 (oestrad* or estrad* or evorel or ethinyloestrad* or ethinylestrad* or elleste or progynova 

or zumenon or bedol or femseven or nuvelle).ti,ab. 100411 

31 or/21-30 346508 


Puberty Blockers 


32 Gonadotropin-Releasing Hormone/ 28845 
33 (pubert* adj3 block*).ti,ab. 142 

34 ((gonadotrophin or gonadotropin) and releasing).ti,ab. 20158 
35 (GnRH adj2 analog*).ti,ab. 2879 

36 GnRH*.ti,ab. 24437 

37 "GnRH agonist*".ti,ab. 4763 

38 Triptorelin Pamoate/ 1983 

39 triptorelin.ti,ab.822 

40 arvekap.ti,ab. 1 

41 ("AY 25650" or AY25650).ti,ab. 1 

42 ("BIM 21003" or BIM21003).ti,ab. 0 

43 ("BN 52014" or BN52014).ti,ab. 0 

44 ("CL 118532" or CL118532).ti,ab. 0 


Debio.ti,ab. 119 
diphereline.ti,ab. 28 
moapar.ti,ab. 0 
pamorelin.ti,ab.1 

trelstar.tiab. 3 
triptodur.ti,ab. 1 

("WY 42422" or WY42422).ti,ab. 
("WY 42462" or WY42462).ti,ab. 


gonapeptyl.ti,ab. 0 
decapeptyl.ti,ab. 225 
salvacyl.ti,ab. 0 


Buserelin/ 2137 
buserelin.ti,ab. 1396 
onist.ti,ab. 0 


("hoe 766" or hoe-766 or hoe766).ti,ab. 72 


profact.ti,ab. 2 

receptal.ti,ab. 31 
suprecur.ti,ab. 5 
suprefact.ti,ab. 25 

tiloryth.ti,ab. 0 

histrelin.tiab. 78 
"LHRH-hydrogel implant".ti,ab. 1 
("RL 0903" or RLO903).ti,ab. 1 
("SPD 424" or SPD424).ti,ab. 1 
goserelin.ti,ab. 1017 

Goserelin/ 1644 

("ici 118630" or ici118630).ti,ab. 
("ZD-9393" or ZD9393).ti,ab. 0 
zoladex.ti,ab. 388 
leuprorelin.ti,ab. 529 
carcinil.tiab. 0 
enanton*.ti,ab. 26 

ginecrin.ti,ab. O 

leuplin.tiab. 15 

Leuprolide/ 3018 
leuprolide.ti,ab. 2003 

lucrin.ti,ab. 16 

lupron.tiab. 183 
provren.ti,ab. 0 

procrin.ti,ab. 3 


("tap 144" or tap144).ti,ab. 41 
(a-43818 or a43818).ti,ab. 3 
Trenantone.ti,ab. 2 
staladex.ti,ab. 0 


prostap.ti,ab. 6 
Nafarelin/ 327 
nafarelin.ti,ab. 263 


("76932-56-4" or "76932564").ti,ab. 


51 


93 ("76932-60-0" or "76932600").ti,ab. 0 
94 ("86220-42-0" or "86220420").tiab. 0 
95 ("rs 94991 298" or rs94991298).ti,ab. 0 
96 synarel.ti,ab. 13 

97 deslorelin.ti,ab. 310 

98 gonadorelin.ti,ab. 238 

99 ("33515-09-2" or "33515092").ti,ab. 
100 ("51952-41-1" or "51952411").ti,ab. 0 
101 ("52699-48-6" or "52699486").ti,ab. 0 
102 cetrorelix.ti,ab. 520 

103 cetrotide.ti,ab. 52 

104 ("NS 75A" or NS75A).ti,ab. 0 

105 ("NS 75B" or NS75B).ti,ab. 0 

106 ("SB 075" or $BO75).ti,ab. 1 

107 ("SB 75" or SB75).ti,ab. 67 

108 gonadoliberin.ti,ab. 152 

109 kryptocur.ti,ab. 7 

110 cetrorelix.ti,ab. 520 

111 cetrotide.ti,ab. 52 

112 antagon.ti,ab. 18 

113 ganirelix.ti,ab. 161 

114 ("ORG 37462" or ORG37462).ti,ab. 3 
115 orgalutran.ti,ab. 26 

116 ("RS 26306" or RS26306).ti,ab. 5 

117 ("AY 24031" or AY24031).ti,ab. O 

118 factrel.ti,ab. 13 

119fertagyl.ti,ab. 12 

120lutrelef.tiab. 5 

121 lutrepulse.ti,ab. 3 

122relefact.ti,ab. 10 

123fertiral.tiab. 0 

124 (hoe471 or "hoe 471").ti,ab. 6 
125relisorm.ti,ab. 4 

126 cystorelin.ti,ab. 19 

127 dirigestran.ti,ab. 5 

128 or/32-127 47179 


o 


Surgery 


129 virilization/ 2309 

130 (virilism or virili?ation or masculini?ation).mp. 5664 
131feminization/ 798 

132 femini?ation.mp. 3425 

133 (vaginoplasty or vaginoplasties).mp. 1032 

134 (vaginoplasty or vaginoplasties).mp. 1032 

135 (phalloplasty or phalloplasties).mp. 561 

136 exp Penile Prosthesis/ 1642 

137"penile reconstruction".mp. _ 292 


138 (vagina reconstruction or vaginal reconstruction).mp. 


139 (genitoplasty or genitoplasties).mp. 263 
140 transsexualism/su [Surgery] 1007 
141sex reassignment.mp. 1674 

142 sex transformation.mp. 42 

143 or/129-142 14290 


Any intervention AND population 


14431 or 128 or 143 386835 
145 20 and 144 16516 


Limit to Humans 

146 animals/ not humans/ 4972586 
147145 not 146 =: 9281 

148 limit 147 tohumans 7901 


Limit to Publication Year 2019 to Current 


149 limit 148 to yr="2019 -Current" 1859 


550 


Effects of gender affirming therapies in people with gender dysphoria: evaluation of the best available evidence. 
Dr. Romina Brignardello-Petersen and Dr. Wojtek Wiercioch; Results; May 16, 2022 


Results 


Search results and eligible reviews: After screening 647 records found through our searches, we found 
61 eligible systematic reviews. From these, 27 were published between 2020 and 2022 (Figure 1). 
Overall, 4% (1/27) of the reviews were judged to be of high methodological quality, 15% (4/27) were 
moderate methodological quality, 37% (10/27) were low methodological quality, and 44% (12/27) 
were critically low methodological quality. 


We provide reasons for excluding systematic reviews in appendix 1. 


Effects of gender affirming therapies in people with gender dysphoria: evaluation of the best available evidence. Dr. Romina Brignardello-Petersen and Dr. Wojtek 


Wiercioch; Results; May 16, 2022 


Records identified from: 
Medline (n = 288) 
Epistemonikos (n = 425) 


Records screened 
(n = 618) 


Reports sought for retrieval 
(n = 103) 


Reports assessed for eligibility 
(n= 101) 


Studies included in review 
(n = 61) 
e Puberty blockers (n = 3) 
« Cross-sex hormones (n = 22) 
e Gender-affirming surgery (n = 30) 
Multiple interventions (n = 6) 


Records removed before 
screening: 


Duplicate records removed 
(n = 95) 


Records excluded 
(n =515) 


Reports not retrieved 
(n=2) 


Reports excluded: 43 


Wrong outcomes (n = 16) 
Other surgery (e.g. voice, rib, 


face, hair, hands, etc.) (n = 13) 
Wrong study design (n = 5) 
Wrong patient population (n = 5) 
Published as abstract (n = 2) 
Wrong intervention (n = 2) 


Records identified from: 
AHRQ Qe?) screening: 
SEGM (n= 5) a records removed 
NICE (n = 16) 


Records removed before 


Records screened 
(n = 29) 


Records excluded 
(n = 18) 


Reports sought for retrieval 
(n= 11) 


Reports not retrieved 
(n=1) 


mo) assessed for eligibility eee 

Wrong population (n = 3) 
Wrong outcomes (n = 1) 
Wrong study design (n = 1) 
Non-English language (n = 2) 


Effects of gender affirming therapies in people with gender dysphoria: evaluation of the best available evidence. Dr. Romina Brignardello-Petersen and Dr. Wojtek 
Wiercioch; Results; May 16, 2022 


Figure 1: PRISMA flow diagram for the selection of systematic reviews. From: Page MJ, McKenzie JE, Bossuyt PM, Boutron |, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an 
updated guideline for reporting systematic reviews. BMJ 2021;372:n71. doi: 10.1136/bmj.n71. For more information, visit: http://www.prisma-statement.org/ 


Effects of gender affirming therapies in people with gender dysphoria: evaluation of the best available evidence. 
Dr. Romina Brignardello-Petersen and Dr. Wojtek Wiercioch; Results; May 16, 2022 


Outcomes: 


1. Puberty blockers: We found 4 systematic reviews assessing the effects of puberty blockers 
published between 2020 and 2022.1“ From these, we judged 2 as having moderate 
methodological quality, and 2 as having critically low methodological quality. Details of the 
assessment are provided in Figure 2. 


Table 1 summarizes the evidence about the effects of puberty blockers on the outcomes of 
interest. We used information from 2 systematic reviews.23 For most outcomes (except 
suicidality), there is no evidence about the effect of puberty blockers compared to not using 
puberty blockers. In other words, no studies compared the outcomes between a group of 
people with gender dysphoria using puberty blockers and another not using them. Therefore, it 
is unknown whether people with gender dysphoria who use puberty blockers experience more 
improvement in gender dysphoria, depression, anxiety, and quality of life than those with 
gender dysphoria who do not use them. There is very low certainty about the effects of puberty 
blockers on suicidal ideation (see details in Table 1). 


Studies, however, reported outcomes among a group of people with gender dysphoria after 

receiving puberty blockers. The findings are: 

- There is low certainty evidence suggesting that treatment with puberty hormones may 
slightly increase gender dysphoria severity (mean change score in the Utrecht Gender 
Dysphoria scale, 0.7 points [95% Cl, -4.2 to 5.6], range 12-60, with higher scores reflecting 
more severe gender dysphoria) 

- There is low certainty evidence suggesting that treatment with puberty blockers may 
decrease depression (mean change score in the Beck Depression Inventory, -3.4 [95% Cl, - 
5.7 to -1.0], range 0-63, with higher scores reflecting more severe depression) 

- There is low certainty evidence suggesting that treatment with puberty blockers may 
decrease anxiety (mean change score in the Trait Anxiety Scale, trait subscale, -1.5 [95% Cl, - 
4.7 to -1.8], range 0-80, with higher scores reflecting more severe anxiety) 

- There is low certainty evidence suggesting a moderate percentage of patients reporting 
adverse events after treatment with puberty blockers (see Table 1 for details) 

- There is very low certainty evidence about how puberty blockers affect suicidality 
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Figure 2: AMSTAR assessment judgements for systematic reviews addressing puberty blockers 
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Table 1: Puberty blockers (gonadotrophin releasing hormone analogues) compared to no puberty blockers in youth (<21 years old) with gender 
dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: puberty blockers (gonadotrophin releasing hormone analogues) 
Comparison: no puberty blockers 


Anticipated absolute effects” (95% Cl) 


Nae Certainty 
Outcomes Risk / mean with Risk / mean with ne Ane participants aoe | What happens 
no puberty puberty blockers eat (studies) a Sle 
blockers (GRADE) 
Gender dysphoria 
assessed with: difference 
The effects of puberty blockers on gender 
(effect) in gender Not reported 
dysphoria proportion or dysphoria are unknown 
severity 
Gender dysphoria 


assessed with: mean 
change score in the 
Utrecht Gender Dysphoria 
Scale (12-60, higher scores 
reflect more gender 0.7 41 ras The mean gender dysphoria score may increase 
dysphoria, 40 points or (-4.2 to 5.6) (1 study) by 0.7 points after puberty blockers 
more indicate a diagnosis 
of gender dysphoria) 
(NICE, 2020a) 
Follow up: mean 1.9 years 
(range 0.4 to 5.1 years) 


Depression 
assessed with: difference The effects of puberty blockers on depression 
(effect) in depression Net are unknown 


proportion or severity 
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Table 1: Puberty blockers (gonadotrophin releasing hormone analogues) compared to no puberty blockers in youth (<21 years old) with gender 
dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: puberty blockers (gonadotrophin releasing hormone analogues) 
Comparison: no puberty blockers 


Anticipated absolute effects’ (95% Cl) wf Certainty 
Outcomes Risk / mean with © Risk / mean with eee Hae participants Aan What happens 
no puberty puberty blockers 3 (studies) ic ae 
blockers 
Depression 
assessed with: mean 
change score in Beck 
®®OO 


Depression Inventory-ti -3.4 41 The mean depression score may decrease by 3.4 
scale (0-63, higher scores NA (5.7 to-1.0) (1 study) Low? pani after pub erty bh ockers ¥ 
represent more severe 
depression) (NICE, 2020a) 
Follow up: mean 1.9 years 
(range 0.4 to 5.1 years) 


Anxiety 
assessed with: difference 
(effect) in anxiety Not reported 
proportion or severity 


The effects of puberty blockers on anxiety are 
unknown 


Anxiety 
assessed with: mean 


change score in STAI-Trait 
scale (0-80, higher scores NA -1.5 NA 41 ®8OO The mean anxiety score may decrease by 1.5 


represent more severe (-4.7 to 1.8) (1 study) Low? points after puberty blockers 


anxiety) (NICE, 2020a) 
Follow up: mean 1.9 years 
(range 0.4 to 5.1 years) 
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Table 1; Puberty blockers (gonadotrophin releasing hormone analogues) compared to no puberty blockers in youth (<21 years old) with gender 
dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: puberty blockers (gonadotrophin releasing hormone analogues) 
Comparison: no puberty blockers 


Anticipated absolute effects” (95% Cl) 


No of Certainty 


Relative effect atticioants of the 
(95% Cl) Bae 


(studies) 


Outcomes Risk / mean with Risk / mean with 
no puberty puberty blockers 
blockers 


hat h 
evidence WUCUD BUDS 


(GRADE) 


Quality of life 
assessed with: any Not reported 
measure 


The effects of puberty blockers on quality of life 
are unknown 


Suicidal ideation 


difference (effect) in 
suicidal eta The authors report that “compared to youth who did not receive pubertal 
2020 , suppression, those who did showed lower lifetime rates 
) of suicidal ideation”. 
Follow-up: cross-sectional 


survey 


89 @®O0OO We are very uncertain about the effect of 
(1 study) VERY LOW? puberty blockers on suicidal ideation 


Adverse effects 
assessed with: proportion 
of patients reporting 11%? 7 ®®0O The proportion of patients reporting adverse 
adverse effects (NICE, NA (2% to 29%) NA (1 study) Low‘ effects after treatment with puberty blockers 
2020a) may be 11% 
Follow up: mean 2.3 years 
(range 0.0 to 11.3 years) 


STAI-Trait: Trait Anxiety Scale. Range: 0-80 
Cl: Confidence interval 
NA: Not applicable 
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Table 1; Puberty blockers (gonadotrophin releasing hormone analogues) compared to no puberty blockers in youth (<21 years old) with gender 
dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: puberty blockers (gonadotrophin releasing hormone analogues) 
Comparison: no puberty blockers 


Anticipated absolute effects” (95% Cl) Cota 


Nee) § 
7 Q 
Outcomes Risk / mean with Risk / mean with Belatve cuece participants DANE 


no puberty puberty blockers Bec) Se 


blockers 


(studies) (GRADE) 


] 

| 

evidence | 
| 


GRADE Working Group grades of evidence 

High certainty: We are very confident that the true effect lies close to that of the estimate of the effect 

Moderate certainty: We are moderately confident In the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different 
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect 

Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect 


Explanations 


1. Mean change rated down due to risk of bias and imprecision. According to the systematic review authors, the study had poor methodological quality. In 
addition, there are too few participants included, which is not sufficient to make trustworthy inferences (does not meet the optimal information size). 

2. The authors of Rew 2020 narratively summarized the outcome of Turban et al. 2020; a cross-sectional online survey study. According to the systematic review 
authors, Turban et al. did not describe the study participants and the setting in detail and it was unclear whether outcomes were measured in a valid and 
reliable way. We therefore, downgraded the certainty of evidence by one level from low to very low due to high risk of bias. 

3. The authors reported 3/27 (11%) participants treated with GnRHa developed side effects: 1 participant developed sterile abscesses; they were switched from 
leuprolide acetate to triptorelin, 1 participant developed leg pains and headaches, which eventually resolved without treatment, 1 participant gained 19 kg 
within 9 months of initiating GnRH analogues. 

4. Proportion of adverse effects rated down due to risk of bias and imprecision. According to the systematic review authors, the cohort study Khatchadourian et 
al. 2014 was assessed at high risk of bias due to incomplete reporting of its cohort. In addition, there are too few participants included, which is not sufficient 
to make trustworthy inferences (does not meet the optimal information size). 
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2. 


Cross-sex hormones: We found 9 systematic reviews assessing the effects of cross-sex 
hormones published between 2020 and 2022.*? One of these, however, included both puberty 
blockers and cross-sex hormones combined in their evidence synthesis as was not prioritized.5 
From the 8 remaining reviews, we judged 1 as having high methodological quality, 2 as having 
moderate methodological quality, 2 as having low methodological quality, and 3 as having 
critically low methodological quality. Details of the assessment are provided in Figure 3. 
Because of its eligibility criteria related to study design, the systematic review judged at high 
methodological quality’ did not include any studies and therefore we could not use it to inform 
any outcome. 


Table 2 summarizes the evidence about the effects of cross-sex hormones on the outcomes of 
interest. We used information from 4 systematic reviews.°9 +! 12 For most outcomes (all except 
risk of breast cancer), there is no evidence about the effect of cross-sex hormones compared to 
not using cross-sex hormones. In other words, no studies compared the outcomes between a 
group of people with gender dysphoria using cross-sex hormones and another not using it. 
Therefore, it is unknown whether people with gender dysphoria who use cross-sex hormones 
experience more improvement in gender dysphoria, depression, anxiety, quality of life, and 
suicidality than those with gender dysphoria who do not use them. There is low certainty 
evidence suggesting that cross-sex hormones may not increase or decrease the risk of breast 
cancer (see details in Table 2). 


Studies, however, reported outcomes among a group of people with gender dysphoria after 

receiving cross-sex hormones. The findings are: 

- There is low certainty evidence suggesting that treatment with cross-sex hormones may 
decrease gender dysphoria severity (mean change score in the Utrecht Gender Dysphoria 
scale, -42.4 points [95% Cl, -44.1 to -40.1], range 12-60, with higher scores reflecting more 
severe gender dysphoria) 

- There is low certainty evidence suggesting that treatment with cross-sex hormones may 
decrease depression (measured with different scales, see Table 4 for details) and the need 
for treatment for depression (change in percentage, -39%) 

- There is low certainty evidence suggesting that treatment with cross-sex hormones may 
decrease anxiety (measured with different scales, see Table 4 for details) and the need for 
treatment for anxiety (change in percentage, -32%) 

- There is very low certainty about the change in quality of life after treatment with cross-sex 
hormones. 

- There is low certainty evidence suggesting that treatment with cross-sex hormones may 
decrease suicidality degree (mean change score in the Ask Suicide-Screening questions 
scale, -0.84 points [95% Cl, -1.30 to -0.44], range 0-4, with higher scores reflecting more 
severe suicidality) and the percentage of patients with need for treatment due to 
suicidality/self-harm (change in percentage, -31%). There is very low certainty evidence 
about the percentage of people with suicidal ideation and suicide attempts after treatment 
with cross-sex hormones. 
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- There is low certainty evidence suggesting a low prevalence of venous thromboembolism 
after treatment with cross-sex hormones (see Table 2 for details) 
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Figure 3: AMSTAR assessment judgements for systematic reviews addressing cross-sex hormones 
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Table 2: Cross-sex hormones compared to no cross-sex hormones in youth (<21 years old) with gender dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: cross-sex hormones 
Comparison: no cross-sex hormones 


Anticipated absolute effects’ (95% Cl) 
No of 


=, R f 
Outcomes Risk / mean with Risk/ mean with ne te participants Mae 
No cross-sex cross-sex hormones 6 Cl) (studies) ee pat 


hormones 


Certainty | 
| 
| 


What happens 


Gender dysphoria 
assessed with: difference 
(effect) in gender Not reported 
dysphoria percentage or 
severity 


Gender dysphoria 
assessed with: mean 
change score in the 
Utrecht Gender Dysphoria 
sag earn reo NA 42.4 NA 23 eS The mean gender dysphoria score may decrease 
dysphoria, 40 points or (-44.1 to -40.1) (1 study) by 42 points after cross-sex hormones 
more indicate a diagnosis 
of gender dysphoria) 
(NICE, 2020b) 
Follow up: 1 year 


The effects of cross-sex hormones on gender 
dysphoria are unknown 


Depression 
assessed with: difference The effects of cross-sex hormones on 
(effect) in depression Not reported depression are unknown 


percentage or severity 
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Table 2: Cross-sex hormones compared to no cross-sex hormones in youth (<21 years old) with gender dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: cross-sex hormones 
Comparison: no cross-sex hormones 


Anticipated absolute effects” (95% Cl) 


No at Certainty 
‘ Relative effect 5, of the 
Outcomes Risk / mean with Risk/ mean with ee ab participants re What happens 
no cross-sex cross-sex hormones a (studies) praia 
hormones ( ) 
Depression The mean depression score 
assessed with: mean reduction was 9.6 points 
change score in when using the BDI-II scale @®®OO 
depression scales (higher fia (n=23) and 7.5 when using ere 73 Low! The mean depression score may decrease after 
scores represent more the CESD-R scale (n=50). (2 studies) cross-sex hormones 
severe depression) (NICE, The authors report that 
2020b) both reductions were 
Follow up: 1 year statistically significant? 
babe i The percentage of 
assesset ; change in participants requiring 
percentage of patients WA treatment was reduced by ik 52 @®BOO ce ee Sil al iat 
with need for treatment 39% (from 54% at baseline), (1 study) Low! ater may: pa Rance after cross- 
(NICE, 2020b) which was statistically 
Follow-up: 1 year significant 
Anxiety 
assessed with: difference The effects of cross-sex hormones on anxiety 
(effect) in anxiety Not reported are unknown 
percentage or severity 
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Table 2: Cross-sex hormones compared to no cross-sex hormones in youth (<21 years old) with gender dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: cross-sex hormones 
Comparison: no cross-sex hormones 


Anticipated absolute effects’ (95% Cl) 


Certainty 
Relative effect Naot of the 
Outcomes Risk / mean with Risk/ mean with (95% Cl) participants | arta What happens 
no cross-sex cross-sex hormones ‘s (studies) eee 
hormones ( ) 
Anxiety The mean anxiety score 
assessed with: mean ‘even ing a, eee 
when using the In, 
ye . re in anxiety nn scale and 14.5 when using ‘n 23 ®BOO The mean anxiety score may decrease after 
es (nigner scores the STAI-Trait scale. The (1 study) Low? cross-sex hormones 
represent more severe authors report that both 
anxiety) (NICE, 2020b) reductions were statistically 
Follow up: 1 year significant 
Fries, 4 : The percentage of 
assessed with: change in participants requiring 
percentage of patients A treatment was reduced by nk 52 ®80O0 wee fas kad sade anesl si rm 
with need for treatment 32% (from 48% at baseline), (1 study) Low? " oui hormones 
(NICE, 2020b) which was statistically 


Follow-up: 1 year significant 


Quality of life 
assessed with: difference 
(effect) in quality of life 
improvement 


The effects of cross-sex hormones on quality of 


Not reported life are unknown 
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Table 2: Cross-sex hormones compared to no cross-sex hormones in youth (<21 years old) with gender dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: cross-sex hormones 
Comparison: no cross-sex hormones 


Anticipated absolute effects” (95% Cl) Certainty 
Ne of 
< I 
Outcomes Risk / mean with Risk/ mean with ale ak oe participants et What happens 
no cross-sex cross-sex hormones (95% yetteltst3) 5 ies 
hormones 
Quality of life 
assessed with: mean The mean qualty of Ife 
e score improved, but the 
change seni QLES-Q-SF ™ ‘ifferences were not i 50 @®OOO We are very uncertain about the quality of life 
score (higher scores statistically significant. The (1 study) VERY LOW? change after cross-sex hormones 


represent better quality of magnitudes were not 
life) (NICE, 2020b) reported 
Follow up: 1 year 


Suicide/ suicidal ideation 
assessed with: difference Not reported The effects of cross-sex hormones on suicide/ 
(effect) in suicide or ee suicidal ideation are unknown 


suicidal ideation 


Suicidality 
assessed with: change in 
score from ASQ 
instrument (higher scores -0.84 39 ®80O0 Suicidality scores may decrease by 0.84 points 
represent greater degree {-1.30 to -0.44) (1 study) Low? after cross-sex hormones 
of suicidality) (NICE, 
2020b) 
Mean follow up: 1 year 
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Table 2: Cross-sex hormones compared to no cross-sex hormones in youth (<21 years old) with gender dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: cross-sex hormones 
Comparison: no cross-sex hormones 


Anticipated absolute effects” (95% Cl) 


No of Certainty 
Relative effect : if th 
Outcomes Risk / mean with Risk/ mean with reese cl) i participants x be What happens 
no cross-sex cross-sex hormones Y (studies) & seer a 
hormones ( ) 
Suicidal ideation 
assessed with: percentage The percentage of 
of participants with recat aa (od 50 @®OO0O We are very uncertain about the change in 
suicidal ideation NA (from 10% at baseline). The NA (1 study) VERY LOW? percentage of patients in suicidal ideation after 
measured with PHQ-9 authors did not conduct a Grosse hormones 
(NICE, 2020b) statistical analysis 
Follow-up: 1 year 
The percentage of people 
Suicide attempts with lifetime suicide 
ae attempts was 15%, those We are very uncertain about the percentage of 
ee NA with attempts 3 months NA a aaa ®000 | people with suicide attempts after cross-sex 
reported ( ij ) before treatment was 2%, VERY LOW hormones 
Follow up: not reported and those with attempts at 
follow up was 5% 
pecs eer The percentage of 
assessed with: change in participants requiring 
The percentage of participants requiring 
percentage of patients treatment was reduced by 52 ®80OO 
with need for treatment ms 31% (from 35% at baseline), be (1 study) Low: treatment may pecan after cross- 
(NICE, 2020b) which was statistically 
Follow-up: 1 year significant 
Venous 
thromt olism Not reported The effects of cross-sex hormones on the risk of 


assessed with: Risk of VTE VTE are unknown 


Effects of gender affirming therapies in people with gender dysphoria: evaluation of the best available evidence. Dr. Romina Brignardello-Petersen and Dr. 
Wojtek Wiercioch; Results; May 16, 2022 


Table 2: Cross-sex hormones compared to no cross-sex hormones in youth (<21 years old) with gender dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: cross-sex hormones 
Comparison: no cross-sex hormones 


. 
Anticipated absolute effects” (95% Cl) dad Certainty 
Relat = f th 
Outcomes Risk / mean with Risk/ mean with " fess see participants o, . What happens 
no cross-sex cross-sex hormones i (studies) aoe 
hormones Me y 
Venous 
thromboembolism ®0@08O 
assessed with: Prevalence NA 20 per 1,000 NA 11,542 MODERATE? The prevalence of VTE among assigned males at 
among assigned males at (10 to 30) (18 studies) birth is probably 2% after cross-sex hormones 
birth (Totaro, 2021) 
Mean follow up: 4.1 years 
Venous 
thromboembolism ®@®SO the prevalence of VTE among assigned females 
i e pi 
assessed with: Prevalence NA 6 per 1,000 NA 4218 MODERATE® at birth is probably 0.6% after cross-sex 
among assigned females (CI not reported)® (8 studies) hormones 
at birth (Kotamarti, 2021) 
Mean follow up: 5.7 years 
Two studies compare the risk of beast cancer 
Greasticancer between assigned females at birth using versus 
assessed with: Risk of not using testosterone, and found no differences 
sraaearca (Fle eri (O vs 1 case [total n= 130], and 1 vs 6 [total wa 2,938 ®80O0 The risk of breast cancer may not increase or 
2020) ” n=1579]). A third study compared assigned (3 studies) LOW’ decrease due to the use of cross-sex hormones 


females at birth with non transgender women and 
found a lower risk in the former (magnitude not 
reported) 


Follow up: not reported 
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Table 2: Cross-sex hormones compared to no cross-sex hormones in youth (<21 years old) with gender dysphoria 


Patient or population: youth (<21 years old) with gender dysphoria 
Intervention: cross-sex hormones 
Comparison: no cross-sex hormones 


Anticipated absolute effects” (95% Cl) Certainty 


No of 
f th 
Outcomes Risk / mean with Risk/ mean with Reauye eure participants orens What happens 


evidence 
(GRADE) 


no cross-sex _cross-sex hormones |NSendie 


hormones 


(studies) 


ASQ: Ask Suicide-Screening Questions. Range 0-4 

BDI-II: Beck Depression Inventory. Range: 0-63 

CESD-R: Center for Epidemiological Studies Depression Scale, Range: 0-60 

Cl: Confidence interval 

NA: Not applicable 

PHQ-9: Patient Health Questionnaire (PHQ) Modified for Teens. For suicidal ideation, it is a single question (yes/no) 
QLES-Q-SF: Quality of Life Enjoyment and Satisfaction Questionnaire. Range: 15-75 

STAI: State-Trait Anxiety Inventory. Range: 0-80 


GRADE Working Group grades of evidence 

High certainty: We are very confident that the true effect lies close to that of the estimate of the effect 

Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different 
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect 

Very low certainty: We have very little confidence in the effect estimate: The true effect Is likely to be substantially different from the estimate of effect 


Explanations 


1. Mean change rated down due to risk of bias and imprecision. According to the systematic review authors, the studies had poor methodological quality. In 
addition, there are too few participants included, which is not sufficient to make trustworthy inferences (does not meet the optimal information size) 

2. Similar results when this outcome was measured using the Patient Health Questionnaire (PHQ) Modified for Teens in one of the same studies 

3. Rated down due to risk of bias, imprecision, and indirectness. According to the systematic review authors, the studies had poor methodological quality. In 
addition, there are too few participants included, which is not sufficient to make trustworthy inferences (does not meet the optimal information size). Finally, 
30% of the participants did not have a diagnosis of gender dysphoria. 

4. Prevalence rated down due to risk of bias. According to the systematic review authors, only 6 out of the 18 studies (representing 16.5% of the weight of the 
studies) were at low risk of bias. 
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5. Ameta-analysis of independent studies reported in this systematic review suggested that the prevalence of VTE in non-transgender females at birth was 1.7% 
(based on 7 studies and 18,748 persons) 

6. Prevalence rated down doe to risk of bias. According to the systematic review authors, all studies had at least one domain judged as problematic. 

Ti 


Risk rated down 2 levels because of risk of bias. The researchers did not account for confounding in any of the studies. 
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3. Surgeries: We found 15 systematic reviews assessing the effects of gender-affirming surgeries 
published between 2020 and 2022. We judged 8 as having low methodological quality and 7 as 
having critically low methodological quality. Details of the assessment are provided in Figure 4. 
We present the results regarding the effects of surgeries in three parts. First, we describe the 
effects of all surgeries on mental health outcomes in all patients. Second, we describe the 
effects of all surgeries on surgical outcomes in assigned females at birth (transgender males). 
Finally, we describe the effects of all surgeries on surgical outcomes in assigned males at birth 
(transgender females). 


3.1 Effects of surgeries on mental health outcomes: Table 3 summarizes the evidence about the 
effects of all surgeries on mental health outcomes in all patients. We used information from 2 
systematic reviews.’ “4 There were no systematic reviews and studies reporting on gender 
dysphoria, depression, anxiety, and suicidality. Therefore, the effects of surgeries on these 
outcomes (when compared to a group of patients with gender dysphoria who do not undergo 
surgery), or the changes in these outcomes (improvements or deterioration) among patients 
who undergo surgeries is unknown. 


The systematic reviews addressed quality of life and depression, but none of the included 
studies included a comparison group. Thus, it is unknown whether people with gender 
dysphoria who undergo surgeries experience more improvement in quality of life or less regret 
than those with gender dysphoria who do not undergo surgeries. 


Studies, however, reported the following outcomes among a group of people with gender 

dysphoria after undergoing surgeries. The findings are: 

- There is low certainty evidence suggesting that the percentage of people who experience 
regret after surgery is low (1%) 

- There is very low certainty evidence about how surgeries affect quality of life (see Table 3 
for details) 
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Figure 4: AMSTAR assessment judgements for systematic reviews addressing gender-affirming surgery 


Review ID Item | Item | Item | Item | Item | Item | Item | Item | Item | Item | Item | Item | [tem | Item | Item | Item | Methodological 
1 2 3 4 5 7 8 9 10 11 12 13 14 15 16 quality 


LOW 


Bustos SS 2021 
Bustos VP 2021 


Bustos VP 2021b 
Dunford 2021 


Eftekhar, 2020, | 
Falcone 2021 


CRITICALLY LOW 


Hu, 2022 


Huayllani 2021 rT 


CRITICALLY LOW 
CRITICALLY LOW 


Jolly2021 LOW 

Nassiri 2020 eel bel CRITICALLY LOW 
Oles 2022 LOW 

Oles 2022b Low 

Salibian 2021 


CRITICALLY LOW 


Sijben 2021 
Tay 2021 


CRITICALLY LOW 


CRITICALLY LOW 


Figure legend: 
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Table 3: All surgeries compared to no surgeries in young people (<21 years old) with gender dysphoria 


Patient or population: young people (<21 years old) with gender dysphoria 
Intervention: surgeries 

Comparison: no surgeries 

Outcomes: Mental health and regret 


Anticipated absolute effects” (95% Cl) Ae Certainty 


Relative effect of the 


(studies) 


no surgery surgery (GRADE) 


| 
Outcomes Risk / mean with Risk/ mean with (95% Cl) participants At Bers What happens 


The effects of surgery on gender dysphoria, the 


Gender dysphoria 
is changes in gender dysphoria severity after 
assessed with: any Not reported surgery, and the prevalence of gender 
measure dysphoria after surgery are unknown 
Depression The effects of surgery on depression, the 
z changes in depression severity after surgery, 
assessed with: any Not reported and the prevalence of depression after surgery 
measure are unknown 
Anxiet The effects of surgery on anxiety, the changes in 
’ anxiety severity after surgery, and the 
assessed with: any Not reported prevalence of anxiety after surgery are 
measure unknown 
The effects of surgery on suicidality, the 
Suicidality rgery B 
‘ changes in anxiety severity after surgery, and 
assessed with: any Not reported the prevalence of anxiety after surgery are 
measure unknown 
Quality of life 
assessed se difference Not reported The effects of surgery on quality of life are 


(effect) in quality of life unknown 


Quality of life 
assessed with: change in Not reported The change in quality of life after surgery is 
i * unknown 
quality of life 
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Quality of life 
assessed with: mean score 
in the Short Form-36 Scale ®OOO ; 
(0-100, higher scores NA 59.17 NA 633 VERY LOW2 We are very uncertain about the quality of life 
reflect better quality of (48.59 to 69.74)? (5 studies) after surgeries 
life) (Eftekhar Ardebili, 
2020) 

Follow up: cross-sectional 


Regret 
assessed with: difference 
(effect) in percentage of 
people with regret 


Not reported The effects of surgery on regret are unknown 


Regret 


— isle ieee ia 1% ‘a 7928 @®BOO The percentage of people who experience 
of people with regret (0 to 2%)? (27 studies) Low* regret is low 
(Bustos, 2021) 


Mean follow up: 4 years 


Cl: Confidence interval 
NA: Not applicable 


GRADE Working Group grades of evidence 

High certainty: We are very confident that the true effect lies close to that of the estimate of the effect 

Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different 
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect 

Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect 


Explanations 


1. Similar scores for assigned males at birth and assigned females at birth. 

2. Mean score rated down for risk of bias and inconsistency. According to the systematic review authors, all studies had concerns related to risk of bias. In 
addition, the smaller studies showed better quality of life than the larger study. 

3. Similar percentage for assigned males at birth and assigned females at birth, and for different types of surgeries (all pooled percentages below 2%). 

4. Percentage rated down due to risk of bias and indirectness. According to the authors, many of the studies had moderate or high risk of bias. The mean age of 
the participants at the time of surgery was higher than the target population. Because it was considered to not have an important effect on the pooled 
estimate, we did not rate down for statistical heterogeneity 
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3.2 Effects of surgeries on assigned females at birth: Table 4 summarizes the evidence about the 
effects of all surgeries on surgical outcomes among assigned at birth females. We used 
information from 3 systematic reviews.**” Due to the nature of the outcomes (i.e. they can 
only be experienced by people who undergo surgeries), there cannot be studies comparing 
the outcomes between a group of people with gender dysphoria who undergo surgeries and 
another who does not. 


Studies, therefore, assessed the outcomes among a group of people with gender dysphoria 
after surgery. The findings are: 
- There is low certainty evidence suggesting that the percentage of people who are satisfied 
after chest surgery is high (92%) 
- There is very low certainty evidence about the rate of surgical complications after chest 
surgery 
- There is very low certainty evidence about the percentage of people who are satisfied, and 
the rate of surgical complications after bottom surgeries (see Table 4 for details) 
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Table 4: All surgeries compared to no surgeries in assigned females at birth (<21 years old) with gender dysphoria 


Patient or population: assigned females at birth (<21 years old) with gender dysphoria 
Intervention: surgeries 
Comparison: no surgeries 


Anticipated absolute effects* (95% Cl) Wid oF Certainty 
Relative effect = of the Rath 
Outcomes Risk / mean with Risk/ mean with (95% Cl) participants evidenee What happens 
no surgery surgery (studies) (GRADE) 
Chest surgery 
Satisfaction 
assessed with: percentage @@00 
of people who reported 
92% 733 3 The percentage of people who reports being 
being sai aaa NE, NA (88% to 96%)? NA (14 studies) Low satisfied may be 92% 
Range of follow up: 6 
weeks to 46 months? 


Surgical complications 

assessed with: rate of @®O0O 
complications across NA 16.8% NA 1255 VERY LOW* We are very uncertain about the rate of surgical 
patients (Oles, 2022) Range (5.5% to 80.0%) (7 studies) complications 
Range of follow up: 8 


weeks to 1 year 


Reoperation 
assessed with: rate of 


reoperation across NA 6.2% NA 1214 ®OOO We are very uncertain about the rate of 
patients (Oles, 2022) Range (0.7% to 11.2%) (6 studies) VERY LOW* reoperation 


Range of follow up: 8 
weeks to 1 year 


Bottom surgery 
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Table 4: All surgeries compared to no surgeries in assigned females at birth (<21 years old) with gender dysphoria 


Patient or population: assigned females at birth (<21 years old) with gender dysphoria 
Intervention: surgeries 
Comparison: no surgeries 


Anticipated absolute effects” (95% Cl) ia Certainty 
Relative effect iG of the What h y 
Outcomes Risk / mean with Risk/ mean with (95% Cl) participants idence at happens 
no surgery surgery (studies) (GRADE) 
Satisfaction 
assessed with: percentage 
lars “= cing NA 89.6% NA 1458 ®O0OO We are very uncertain about the percentage of 
ns pon bd (45% to 100%) (27 studies) VERY LOW* people who reports being satisfied 
Range of follow up: 6 
weeks to 46 months 
Surgical complications- The percentage was 
“ai snd , = 2.3% (range 0 to 20%) 
assessed with: percentage experiencing total flap loss 
of people experiencing ‘ik ~ 19.5% (range 0 to 72%) ‘i 3177 @OOO We pide docreboiicarat ond ertanlingd of 
major complications (Oles, experiencing prosthesis (42 studies)* VERY LOW* complications 
2022b) Issues 
follow up: not reported ~ 24.5% (range 0 to 86%) 
experiencing urethral issues 
Surgical complications- 
Minor The percentage varied from 
assessed with: percentage poet sciatic rr @O0O We are very uncertain about the percentage of 
of people experiencing NA issues, to 24% (range 10 to NA (52 studies)? VERY Low‘ people who experience minor surgical 
major complications (Oles, 93%) experiencing urethral complications 
2022b) issues? 


follow up: not reported 
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Table 4: All surgeries compared to no surgeries in assigned females at birth (<21 years old) with gender dysphoria 


Patient or population: assigned females at birth (<21 years old) with gender dysphoria 
Intervention: surgeries 
Comparison: no surgeries 


Anticipated absolute effects” (95% Cl) Certainty 
Neo of 
a Relative effect Z of the What h 
Outcomes Risk /meanwith _ Risk/ mean with (95% Cl) Rae Per Pela: at happens 
no surgery surgery ete (GRADE) 
Reoperation 
ee ae NA js ae ik NA eae ®OOO We are ery eral about the perenne of 
patients (Oles, 2022b) ange (2.5% to 40%) (15 studies) VERY LOW* people who undergo reoperations 


follow up: not reported 


Cl: Confidence interval 
NA: Not applicable 


GRADE Working Group grades of evidence 

High certainty: We are very confident that the true effect lies close to that of the estimate of the effect 

Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different 
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect 

Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect 


Explanations 


1. Studies used different scales to assess satisfaction 

2. The percentage was similar when the analysis was done by type of surgery and by follow up time (< 1 year vs 1 year or more). Another systematic review (Oles, 
2022) also investigated this outcome, and reported a very similar percentage of satisfaction (91.8%, range 73% to 100%) 

3. Percentage of patients satisfied rated down due to risk of bias and indirectness. According to the systematic review authors, several studies were judged at 
moderate and high risk of bias. In addition, the median of the mean age of patients included in the studies was 28 years 

4, Rated down due to risk of bias, inconsistency/ imprecision, and indirectness. Even though the review authors did not assess risk of bias, these studies were 
included in other systematic reviews in which the authors judged several of them at high risk of bias. The studies report inconsistent results (some high and 
other low rates). The patients are older than the target population. 

5. Results for phalloplasty. Similar results for metoidioplasty (91.3%). 
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6. People and studies for urethral complications. 2671 people (37 studies) for prosthesis issues, and 1548 people (22 studies) for total flap loss. 

7. Percentage of wound dehiscence 9.8% (range, 2.9% to 75%), percentage of infection/ partial necrosis 10.3% (range, 0 to 45.8%), percentage of prosthesis 
issues 14.2% (range, 1.6 to 41.9%), percentage of incontinence 15.3% (range, 5.4% to 59.1%) 

8. 


People and studies for infection/ partial necrosis. 2389 people (31 studies) for urethral issues, 1736 people (17 studies) for wound dehiscence, 1080 (10 
studies) for prosthesis issues, 1053 people (8 studies) for donor site issues, 131 people (3 studies) for incontinence 
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3.3 Effects of surgeries on assigned males at birth: Table 5 summarizes the evidence about 
the effects of all surgeries on surgical outcomes among assigned at birth males. We 
used information from 3 systematic reviews.1® 119 Due to the nature of the outcomes 
(i.e. they can only be experienced by people who undergo surgeries), there cannot be 
studies comparing the outcomes between a group of people with gender dysphoria 
who undergo surgeries and another who does not. 


Studies, therefore, assessed the outcomes among a group of people with gender 
dysphoria after surgery. The findings are: 

- There is low certainty evidence suggesting that the percentage of people who are 
satisfied after vaginoplasty is high (91%) 

- There is very low certainty evidence about the percentage of people who are 
satisfied, the rate of complications, and the rate of reoperations after chest surgery 
(see Table 5 for details) 

- There is low certainty evidence suggesting that the percentage of people who have 
regret after vaginoplasty is low (2%) 

- . There is very low certainty evidence about the rate of complications and the rate of 
reoperations after vaginoplasty (see Table 5 for details) 
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Table 5: All surgeries compared to no surgeries in assigned males at birth (<21 years old) with gender dysphoria 


Patient or population: assigned males at birth (<21 years old) with gender dysphoria 
Intervention: surgeries 
Comparison: no surgeries 


Anticipated absolute effects” (95% Cl) 


Relative effect No of part t Seavalsis Au 
- e) ive erec 20} Fladleler-laie) 
Outcomes Ps P the evidence What happens 


Risk / mean with no _Risk/ mean with surgery (95% Cl) (studies) (GRADE) 
surgery : 
Chest surgery 
Satisfaction 
assessed with: percentage 
of people who reported NA Range 75% (80/107) to NA ae bared ee ee 
being satisfied (Oles 2022) 95% (33/35)! (2 studies) 4 
satisfied 
Range of follow up: 12 
months to 17 years 
The complication rates 
were: 
Surgical complications = 3.8% (range 0% to 
assessed with: rate of 5.5%) of capsular e000 
complications across . pS 432 We are very uncertain about the rate of 
NA contracture NA 4 VERY LOW2 Fj 
patients (Oles 2022) (5 studies) surgical complications 
~ 2.2% of major 
Range of follow up: 2 
weeks to 16 years mematorna 
- 2.2% of implant 
extrusion? 
Reoperation 
assessed with: rate of 
reoperation across NA 8.6% NA 291 ®OOO We are very uncertain about the rate of 
patients (Oles 2022) Range (4.4% to 10.4%) (2 studies) VERY LOW? reoperation 
Range of follow up: Not 
reported 
Bottom surgery 
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Table 5: All surgeries compared to no surgeries in assigned males at birth (<21 years old) with gender dysphoria 


Patient or population: assigned males at birth (<21 years old) with gender dysphoria 
Intervention: surgeries 
Comparison: no surgeries 


Anticipated absolute effects’ (95% Cl) 


§ Certainty of | 
Outcomes Belativerettect Ne obparticipants the evidence What happens 
Risk / mean withno _Risk/ mean with surgery (95% Cl) (studies) (GRADE) 
surgery 
Satisfaction 
assessed with: percentage 
of people who reported 
being satisfied for overall ia 91% ny 1230 @eOO Bes An ae aa f ina eat ba he 
outcomes (Bustos SS, (81% to 98%)* (12 studies) Lows Bg cationed wi Hae odtcomesi may 
91% 
2021) 
Range of follow up: 1 
week to 11.3 years 
Regret 
assessed with: percentage 
of people who reported NA 2% NA 1137 ®@e0O The percentage of people who report 
regret (Bustos SS, 2021) (1% to 3%) (15 studies) Lows regret may be 2% 
Range of follow up: 2 
months to 24.1 years 
The complication rates 
were: 
Surgical complications - 1% (95% Cl, <0.1% to 
assessed with: rate of 2%) of fistula 
complications across - 11% (95% Cl, 8% to 
patients (Bustos SS, 2021) NA 14%) of stenosis and/or NA ted cnsciadl ee psf aa 
Range of follow up: 3 strictures PI 
weeks to 24.1 years - 4% (95% Cl, 1% to 9%) 
of tissue necrosis 
- 3% (95% Cl, 1% to 4%) 
of prolapse” 
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Table 5: All surgeries compared to no surgeries in assigned males at birth (<21 years old) with gender dysphoria 


Patient or population: assigned males at birth (<21 years old) with gender dysphoria 
Intervention: surgeries 
Comparison: no surgeries 


Anticipated absolute effects” (95% Cl) 


Certainty of 


Outcomes Relative effect Neof participants the evidence What happens 
a Risk/ mean withno —_Risk/ mean with surgery (95% Cl) (studies) (GRADE) “ 
surgery 
One study reported a 
Reoperation surgical revision rate of 
assessed with: rate of 9% (1/11 patients), and a 5 
reoperation across nr second study reported a 156 ®OO0O Bisbee pail 
patients (Tay, 2021) that 13% (19/145) (2 studies) VERY LOW? reoperations 
Range of follow up: 6 patients required repeat 
weeks to 14.8 months surgery due to 
complications. 


Cl: Confidence interval 
NA: Not applicable 


GRADE Working Group grades of evidence 

High certainty: We are very confident that the true effect lies close to that of the estimate of the effect 

Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially 
different 

Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect 

Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect 


Explanations 


1. Another systematic review, Sijben 2021, reported satisfaction from 3 additional studies: 82% (113/138) were satisfied or very satisfied, 93% (32/34) were 
happier and more satisfied with their chest, and 79% (28/36) were very satisfied with the overall cosmetic result (very low certainty of evidence due to risk of 
bias, imprecision, and indirectness). 

2. Rated down due to risk of bias, indirectness (the included studies were not restricted to youth or young adults), and imprecision (too few participants included, 
not meeting optimal information size). 
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3. Another systematic review, Sijben 2021, reported similar ranges for rates of complication requiring reoperation from 7 studies (835 patients): capsular 
contraction (range 0.0-5.6%), asymmetry (3.6%), hematoma (range 0.0-2.9%), infection (range 0.0-0.9%), striae distensae (0.7%), implant rupture (0.7%), 
abscess (0.4%), scarring (0.0%), hypersensitivity (0.0%), and numbness (0.0%) (very low certainty of evidence due to risk of bias, imprecision, and indirectness) 

4. Bustos SS et al, 2021 additionally reported on satisfaction for functional (87%, 95% Cl 77% to 94%) and aesthetic (90%, 95% Cl 84% to 94%) outcomes. Another 
systematic review and meta-analysis, Oles 2022b, similarly reported that 92.3% (range 23.1% to 100%) of patients (2410/2601) were satisfied after 
vaginoplasty (very low certainty of evidence due to risk of bias, imprecision, and indirectness). 

5. Rated down due to risk of bias (the systematic review authors reported the quality of the included studies to be low to moderate using the New Castle Ottawa 
scale), and indirectness as the included studies were not restricted to youth or young adults. We did not rate down for imprecision or inconsistency despite 
high I? values as a satisfaction rate of 80% or above was deemed as a minimum threshold for clinical importance. 

6. Rated down due to risk of bias (the systematic review authors reported the quality of the included studies to be low to moderate using the New Castle Ottawa 
scale), and indirectness as the included studies were not restricted to youth or young adults. 

7. Another systematic review, Oles 2022b, similarly reported the percentage of patients experiencing complications from 51 studies, ranging from 2.4% to 12.0% 
(range 0% to 88%) for minor complications (intraoperative injury, wound dehiscence, superficial necrosis, infection, urinary issues, vaginal prolapse, stenosis, 
and bleeding) and 1.6% to 2.1% (range 0% to 31%) for major complications (flap/graft necrosis and infection) after genitoplasty (very low certainty of evidence 
due to risk of bias, imprecision, and indirectness). 

8. Rated down due to risk of bias (the systematic review authors reported the quality of the included studies to be low to moderate using the New Castle Ottawa 
scale), imprecision and inconsistency, with wide confidence intervals and |? values ranging from 65.8% to 94.3%, and indirectness as the included studies were 
not restricted to youth or young adults. 

9. Rated down due to risk of bias, indirectness (the age range of patients in the included studies was 24 to 39 years; the studies included were restricted to those 
that investigated the use of peritoneum in neovagina construction), and imprecision (too few participants included, not meeting optimal information size). 
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Results from search for studies not included in the systematic reviews: After screening 1854 
records found through our searches, we found 10 eligible studies (figure 5). From these, 8 were 
comparative observational studies?°” and 2 were non- comparative?®”°. We provide reasons 
for excluding studies in appendix 2. 


Included 


3] 


Records identified from: 


Medline (n = 1859) 


Records screened 
(n= 1854) 


Reports sought for retrieval 
(n = 59) 


Reports assessed for eligibility 
(n= 57) 


Studies included in review 
(n = 10) 
e Puberty blockers (n = 1) 


 Cross-sex hormones (n = 4) 
e Gender-affirming surgery (n = 2) 
e Multiple interventions (n = 3) 


Records removed before screening: 
Duplicate records removed 
(n=5) 


Records excluded 
(n= 1795) 


Reports not retrieved 
(n= 2) 


Reports excluded: 47 


e Wrong patient population (n = 18) 

e Sample size <100 for non- 
comparative study (n=16) 

e Wrong study design: non- 
comparative (n = 9) 

e Wrong outcomes (n = 2) 

Wrong comparator (n = 2) 


Figure 5: PRISMA flow diagram for the selection of primary studies. From: Page MJ, McKenzie JE, Bossuyt PM, 
Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic 
reviews. BMJ 2021;372:n71. doi: 10.1136/bmj.n71. For more information, visit: http://www.prisma-statement.org/ 
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None of the studies were judged as likely to importantly change the conclusions obtained from 
the systematic reviews (Tables 6 and 7). The main limitations of the comparative studies were 
risk of bias concerns (Figures 6 and 7) due to confounding, classification of intervention, and 
missing data; as well as small sample sizes. Although non-comparative studies were at lower 
risk of bias, because their results were consistent with those of the included evidence, they 
were also judged as unlikely to change the conclusions importantly. 
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Table 6: Characteristics of eligible comparative observational studies 


Study ID 


VanDerMiesen, 
2020 


Becker-Hebly, 
2021 


Green, 2021 


Tordoff, 2022 


Turban, 2022 


Grannis, 2021 


Fontanari, 2020 


Sample 
size* Study design 


450 Retrospective 
cohort study 


75 Prospective 
cohort study 


3235 Cross- 
sectional study 


84 Prospective 
cohort study 


9341 Cross- 
sectional study 


47 Cross- 
sectional study 


350 Cross- 
sectional study 


Intervention 


Puberty 
blockers 


1. Puberty 
blockers 

2. Cross-sex 
hormones 
3. Surgery 


Cross-sex 
hormones 


1. Puberty 
blockers 

2. Cross-sex 
hormones 


Cross-sex 
hormones 


Cross-sex 
hormones 


1. Cross-sex 
hormones 

2. Cross-sex 
hormones or 
surgery 


Comparator 


Waiting for 
puberty blockers 


No medical 
intervention yet; 
psychosocial 
intervention 
only 


Would like to 
take cross-sex 
hormones 


No intervention 


Desired but 
never accessed 
gender affirming 
hormones 


No intervention 
yet 


1, Waiting for 
Cross-Sex 
hormones 

2. No 
intervention 


Outcomes 
measured 


Self-harm/ 
suicidality, 
internalizing 
behaviors 


Health-related 
quality of life 


Depression, 
suicidality 


Depression, 
anxiety, 
suicidal 
thoughts 
Suicidal 
ideation, 
suicidal 
attempt 
Anxiety, 
depression 
Anxiety, 
depression, 
gender distress 
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Likely to 

change 

conclusions Reasons 

No Reports a small benefit on suicidality and 
moderate on internalizing behaviours, but high 
risk of bias 

No Critical risk of bias (missing data due to low 


response rate, and confounding). Reports small 
benefit in mean change score for mental and 
physical dimension QoL as compared to no 
medical treatment. Imprecision; the 95% Cls for 
mean change scores are wide. 


No Critical risk of bias, no follow up of patients 
(measurement of current outcomes and not 
adjusting for baseline) 


No Moderate risk of bias, small sample size 


No Critical risk of bias, no follow up of patients 
(measurement of current outcomes and not 
adjusting for baseline) 


No Critical risk of bias, no follow up of patients, small 
sample size 
No Critical risk of bias (confounding, self-reported 


classification of interventions). Online cross- 
sectional survery reported small benefit in anxiety 
and depression mean scores, and little to no 
effect on gender distress with cross-sex hormones 
and/or surgery. Non-randomized comparative 
study provides very low certainty evidence due to 
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very serious risk of bias and serious imprecision 
(95% Cls include little to no effect) 

Castelo-Branco, 205 Cross- Cross-sex No intervention Anxiety, No Critical risk of bias due to confounding (non- 

2021 sectional study hormones depression adjusted analysis). Reported no difference 
observed in anxiety and depression mean scores 
(Symptom Checklist-90-Revised scale) between 
groups. Non-randomized comparative study 
provides low certainty evidence. 

*Considered the number of participants relevant to the questions of this report, not all people included in the studies 


Table 7: Characteristics of eligible non- comparative observational studies 


Likely to 
Sample Outcomes change 

Study ID size Intervention measured conclusions Reasons 

Reports rate of complications (10.5%) and satisfaction (79% totally satisfied, 20% 

mainly satisfied) within range of effects reported by studies already included in 

Surgical systematic reviews. Unlikely to reduce imprecision and inconsistency within body of 

Bordas, FtM bottom complications, evidence (3177 and 1458 people, respectively) of non-comparative studies (42 and 27, 
2021 813 surgery satisfaction No respectively) to increase certainty of evidence 

Reports rate of complications (16%) and revision surgery (5%), which is consistent with 
Elias, FtM top the rates reported in the studies included. Unlikely to increase the certainty of 
2022 110 surgery Complications No evidence 
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Figure 6: Risk of bias judgements for comparative studies 


Measurement of 
outcome 


Deviations from 
intended interventions 


Confounding Pepewmens te 


Study ID Intervention Missing data 


Becker-Hebly, 
2021 


Castelo- 
Branco, 2021 | Cross-sex hormones 


a 


Puberty blockers, cross-sex 
hormones, or surgery CRITICAL 


Fontanari, Cross-sex hormones, cross- 
2020 sex hormones or surgery 


Grannis, 2021 


Green, 2021 Cross-sex hormones CRITICAL 
Puberty blockers, cross-sex 
Tordoff, 2022 | hormones 

[rurban, 2022 | Cross-sex hormones 


Van Der 
Miesen, 2020 | Puberty blockers SERIOUS 


Cross-sex hormones 


MODERATE 
CRITICAL 


Figure legend: 
Low 
Moderate 
Serious 


cit 


Unclear 
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Figure 7: Risk of bias judgements for non-comparative studies 
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